feat(sdk): harden pkg/kit embedder surface with scoped additions by ezynda3 · Pull Request #69 · mark3labs/kit

ezynda3 · 2026-06-18T12:38:14Z

Description

Hardens the pkg/kit embedder surface with a set of scoped, purely additive
changes that unblock building higher-level agent frameworks on top of Kit. No
existing exported signature changes — every item is a new field, new exported
symbol, or an interface method with a default implementation on the shipped
managers, so zero-valued defaults preserve current behavior.

This implements 11 of the 14 items from the embedder SDK hardening request
(#68). The remaining three (#2 pluggable provider injection, #7 LLMUsage.Extra,
#13 Kit.Fork) are deferred because they require a breaking change or an
agent-core/architecture shift that warrants maintainer sign-off; they are
tracked on the issue.

Example — per-call overrides without rebuilding a *Kit, plus structured-result
extraction and typed error handling:

// Before: rebuild kit.New() per request to change model/creds/tools.
// After: scope everything to a single call.
result, err := host.PromptResultWithOptions(ctx, prompt, kit.PromptOptions{
    Model:          "anthropic/claude-haiku-3-5-20241022",
    ThinkingLevel:  "low",
    ExtraTools:     []kit.Tool{lookupTool},
    ProviderAPIKey: tenantKey,
})
if errors.Is(err, kit.ErrContextOverflow) {
    host.Compact(ctx, nil, "") // classify instead of string-matching
}
if result.HaltedByTool == "finish" {
    answer := result.FinalValue // typed value carried out of a finish() tool
    _ = answer
}

Refs #68

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation update

What's included

Tier 1 — removes embedder workarounds / feature drops:

Add support for overriding model registry and capabilities #1 Guard the config.configPath package global with a sync.RWMutex and
add GetConfigPath; route all readers through it (fixes a -race data race
when concurrent kit.New() calls discover a .kit.yml).
AddMCPServer() creates MCPToolManager without inheriting OAuth AuthHandler and TokenStoreFactory #3 Extend PromptOptions with per-call Model, ThinkingLevel,
ExtraTools, ProviderURL, ProviderAPIKey overrides (scoped apply/restore,
serialized by a mutex) and add PromptResultWithOptions.
OAuth fails for remote MCP servers without dynamic client registration (e.g. GitHub) — no way to provide ClientID #4 Capture per-turn stream deltas into TurnResult.Stream ([]StreamEvent)
in emit order; clarify that PromptResult blocks until end-of-turn.

Tier 2 — simplify embedder code:

feat: open external $EDITOR for composing long prompts via ctrl+x e chord #5 ToolOutput.Halt / FinalValue, surfaced as TurnResult.HaltedByTool /
FinalValue so structured-result patterns no longer need a side-channel.
feat: highlight @file tokens in input with accent color #6 New errors.go with provider-error sentinels (ErrContextOverflow,
ErrRateLimit, ErrAuth, ErrProviderUnavailable, ErrInvalidRequest) plus
ClassifyProviderError, wired into the turn error path.
fix: error calling tool #8 NewRawTool — schema-driven untyped tool constructor for tools sourced
from JSON Schema (skill files, MCP catalogs).
OpenAI login does not become the default provider, and starting with --model openai/... fails until model is reselected in /model #9 Add CompactionEvent.Err and emit the event on the failure path too.
Shift+Enter does not insert a newline even though the UI hint says it should #10 Promote AppendBranchSummary onto the SessionManager interface with
ErrBranchSummaryNotSupported; drop the treeManagerAdapter type-assertion
fallback in CollapseBranch.

Tier 3 — papercuts:

/thinking incorrectly reports that GPT-5.4 does not support thinking/reasoning #11 CloseContext(ctx) for deadline-bounded shutdown (Close() delegates).
TUI feels visually noisy: user label, large prompt placeholder, and input help take too much space #12 ResetForTesting() behind the testing build tag.
Text selection in the TUI is awkward and makes copying partial output frustrating #14 LoadSkillsFromFS(fsys fs.FS, root string) — fs.FS-typed skill loader
sharing the parser with the path-based loaders.

Checklist

My code follows the style guidelines of this project (gofmt, go vet, golangci-lint)
I have performed a self-review of my own code
I have added tests that prove my changes are effective
New and existing unit tests pass locally (go test -race ./...)
I have updated the documentation (SDK README + docs site)

Additional Information

Backward compatibility: fully additive. PromptOptions/ToolOutput/
CompactionEvent gain new fields (zero value = prior behavior); Close() is
unchanged and now delegates to CloseContext.
Interface change: SessionManager gains AppendBranchSummary. The shipped
treeManagerAdapter implements it; custom managers should return
ErrBranchSummaryNotSupported if unsupported.
Tests added: internal/config/configpath_test.go (race),
internal/skills/skills_fs_test.go, pkg/kit/errors_test.go,
pkg/kit/rawtool_test.go, pkg/kit/turn_capture_test.go.
Docs updated: pkg/kit/README.md and the docs site (www/pages/sdk/*,
www/pages/advanced/json-output.md); npx tome build passes.
Deferred (need sign-off): fix: ToolRenderConfig BorderColor and Background fields are ignored #2 provider injection, feat(sdk): expose generation and provider params on Options #7 LLMUsage.Extra
(breaking — it's a type alias for the external fantasy.Usage), Ctrl+C exits the app instead of clearing the current input #13 Kit.Fork.

Summary by CodeRabbit

Release Notes

New Features
- Turn results now include an ordered streaming timeline (text, reasoning, and tool-call chunks), plus tool-driven halt metadata (FinalValue, HaltedByTool).
- Added schema-driven NewRawTool with JSON argument validation.
- Per-call prompt overrides via PromptOptions and PromptResultWithOptions.
- Added LoadSkillsFromFS for embedded skills discovery.
- Improved session branch summary support (AppendBranchSummary / CollapseBranch) with a dedicated “not supported” error sentinel.
Documentation
- Updated SDK guides for streaming, tool halting, per-call overrides, embedded skills, provider error classification, and graceful shutdown (CloseContext).

Implements 11 of the 14 items from the embedder SDK hardening request, all additive (no existing signature changes): - guard config.configPath global with a mutex; add GetConfigPath - extend PromptOptions with per-call model/thinking/tools/provider overrides and add PromptResultWithOptions - capture per-turn stream deltas in TurnResult.Stream ([]StreamEvent) - add ToolOutput.Halt/FinalValue, surfaced as TurnResult.HaltedByTool/ FinalValue for structured-result extraction - add provider-error sentinels + ClassifyProviderError, wired into the turn error path - add NewRawTool schema-driven untyped tool constructor - add CompactionEvent.Err and emit it on the compaction failure path - promote AppendBranchSummary onto SessionManager with ErrBranchSummaryNotSupported; drop the type-assertion fallback - add CloseContext for deadline-bounded shutdown - add ResetForTesting behind the testing build tag - add LoadSkillsFromFS (fs.FS-typed skill loader) Includes unit tests and docs-site coverage for each addition. Refs #68

mark-iii-labs-huly · 2026-06-18T12:38:22Z

Connected to Huly®: KIT-70

coderabbitai · 2026-06-18T12:38:29Z

Warning

Review limit reached

@ezynda3, we couldn't start this review because you've reached your PR review rate limit.

More reviews will be available in 9 minutes and 48 seconds. Learn how PR review limits work.

Your organization has used up its prepaid credits, and credit purchases are no longer available. Enable the review add-on in the billing tab to keep reviews running — you're only billed for reviews past your plan's rate limits ($0.25/file).

⌛ How to resolve this issue?

After more reviews become available, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

To avoid repeated limits, reduce automatic review volume by pausing incremental auto-reviews earlier, using label-based review opt-in, excluding WIP or generated PR titles, or requesting reviews manually when the PR is ready. If your team needs uninterrupted high-volume reviews, an organization admin can enable usage-based credits.

🚦 How do rate limits work?

CodeRabbit enforces per-developer PR review limits for each organization. Most developers receive the normal plan refill rate.

For paid Pro and Pro+ PR reviews, CodeRabbit uses adaptive limits for sustained high-volume activity. When a developer's recent PR review activity reaches the 95th percentile or higher among CodeRabbit users, the refill rate gradually slows as usage increases. The highest same-day bursts are limited more strictly.

Please see our Fair Usage Limits Policy for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: a6e1b7b7-14cb-4ef9-9bd2-db12269b806b

📥 Commits

Reviewing files that changed from the base of the PR and between a3e2b9c and 2ff73ec.

📒 Files selected for processing (2)

pkg/kit/rawtool_test.go
pkg/kit/tools.go

📝 Walkthrough

Walkthrough

This PR adds fourteen scoped SDK capabilities: a concurrency-safe config path with GetConfigPath/SetConfigPath, fs.FS-backed skill loading, five provider error sentinels with ClassifyProviderError, structured tool halt (Halt/FinalValue) and NewRawTool with JSON-schema inputs, per-turn StreamEvent capture in TurnResult, per-call PromptOptions overrides with save/restore, CloseContext for deadline-bounded shutdown, SessionManager.AppendBranchSummary, CompactionEvent.Err failure telemetry, and Agent.GetExtraTools.

Changes

Kit SDK Feature Expansion

Layer / File(s)	Summary
Concurrency-safe config path and test reset `internal/config/config.go`, `internal/config/configpath_test.go`, `pkg/kit/testing.go`	Replaces bare `configPath` global with `sync.RWMutex`-guarded `GetConfigPath`/`SetConfigPath`; `FilepathOr` uses `GetConfigPath()`; concurrent-access test added; `ResetForTesting()` exposed under the `testing` build tag.
fs.FS skill loading (internal + kit wrapper) `internal/skills/skills.go`, `internal/skills/skills_fs_test.go`, `pkg/kit/skills.go`	Extracts `parseSkill(data, srcPath, storePath)` helper; adds `LoadSkillsFromFS` walking an `fs.FS` tree filtering `.md`/`.txt`/`SKILL.md`; wraps it at kit level; tests cover normal walk, nil FS, and field assertions.
Provider error sentinels and ClassifyProviderError `pkg/kit/errors.go`, `pkg/kit/errors_test.go`	Defines `ErrContextOverflow`, `ErrRateLimit`, `ErrAuth`, `ErrProviderUnavailable`, `ErrInvalidRequest`; `ClassifyProviderError` wraps via `sentinelError` (preserving original via `Unwrap`), is idempotent, and validated by table-driven and idempotency tests.
ToolOutput halt, haltHolder, NewRawTool, and streamCollector `pkg/kit/tools.go`, `pkg/kit/turn_capture.go`, `pkg/kit/rawtool_test.go`, `pkg/kit/turn_capture_test.go`	Adds `Halt`/`FinalValue` to `ToolOutput`; introduces `haltHolder` (first-halt-wins, mutex-protected) and `recordHalt` called from `NewTool`/`NewParallelTool`/`NewRawTool`; `NewRawTool` decodes JSON args into `map[string]any` with schema override; `streamCollector` accumulates `StreamEvent`s concurrently.
TurnResult streaming events and generate path wiring `pkg/kit/kit.go`	Extends `TurnResult` with `Stream []StreamEvent`, `FinalValue`, `HaltedByTool`; defines `StreamEventKind` constants and `StreamEvent` struct; `runTurn` attaches `streamCollector`+`haltHolder` to context; `generate` records text/reasoning/tool-call-chunk events; error path uses `ClassifyProviderError`; results drained into `TurnResult`.
Per-call PromptOptions overrides and CloseContext `pkg/kit/kit.go`, `internal/agent/agent.go`	Adds `Model`, `ThinkingLevel`, `ExtraTools`, `ProviderURL`, `ProviderAPIKey` to `PromptOptions`; `PromptResultWithOptions` applies/restores overrides serialized by `promptOptsMu`; `CloseContext(ctx)` added with context-error preference; `Close()` delegates to `CloseContext(context.Background())`; `Agent.GetExtraTools()` returns a copied snapshot.
SessionManager AppendBranchSummary and CollapseBranch refactor `pkg/kit/session.go`, `pkg/kit/sessions.go`, `pkg/kit/adapter.go`	Adds `ErrBranchSummaryNotSupported` sentinel and `AppendBranchSummary(fromID, summary)` to `SessionManager`; `treeManagerAdapter` delegates to inner; `CollapseBranch` removes type-assertion fallback and calls `m.session.AppendBranchSummary` directly.
CompactionEvent failure telemetry `pkg/kit/events.go`, `pkg/kit/compaction.go`	Adds `Err error` to `CompactionEvent`; `compactInternal` delegates to new `compactImpl` and emits `CompactionEvent{Err: err}` on the failure path, symmetrical with success path.
SDK documentation updates `pkg/kit/README.md`, `www/pages/sdk/overview.md`, `www/pages/sdk/callbacks.md`, `www/pages/sdk/sessions.md`, `www/pages/advanced/json-output.md`	Documents all new APIs: `TurnResult.Stream`, per-call `PromptOptions`, `NewRawTool`, `Halt`/`FinalValue`, `LoadSkillsFromFS`, error sentinels, `CloseContext`, `AppendBranchSummary`, and `CompactionEvent.Err` telemetry with examples.

Sequence Diagram(s)

sequenceDiagram
    participant Caller
    participant Kit as Kit.PromptResultWithOptions
    participant runTurn
    participant generate
    participant streamCollector
    participant haltHolder
    participant ClassifyProviderError

    rect rgba(100, 149, 237, 0.5)
        note over Kit,runTurn: Per-call override apply/restore (promptOptsMu)
        Caller->>Kit: PromptResultWithOptions(ctx, msg, opts)
        Kit->>Kit: applyPromptOptions(opts) — save state, set overrides
    end

    Kit->>runTurn: run turn
    runTurn->>runTurn: attach streamCollector + haltHolder to ctx
    runTurn->>generate: generate(ctx, ...)
    generate->>streamCollector: add StreamEvent (text/reasoning/tool chunk)
    generate->>haltHolder: recordHalt via tool callback
    generate-->>runTurn: done or error

    alt provider error
        runTurn->>ClassifyProviderError: classify(err)
        ClassifyProviderError-->>runTurn: sentinel-wrapped error
        runTurn-->>Kit: error
    else success
        runTurn->>streamCollector: drain() → TurnResult.Stream
        runTurn->>haltHolder: snapshot() → TurnResult.HaltedByTool, FinalValue
        runTurn-->>Kit: TurnResult
    end

    Kit->>Kit: restorePromptOptions() — restore saved state
    Kit-->>Caller: TurnResult, error

Estimated code review effort

🎯 5 (Critical) | ⏱️ ~120 minutes

Possibly related issues

feat: embedder SDK hardening — 14 scoped additions for higher-level framework authors #68: This PR directly implements all 14 scoped enhancements listed in that issue, including the configPath data race fix, per-call prompt overrides, StreamEvent capture, structured tool halting, provider error sentinels, NewRawTool, CompactionEvent failure emission, SessionManager AppendBranchSummary, CloseContext, ResetForTesting, and LoadSkillsFromFS.

Possibly related PRs

mark3labs/kit#42: Both PRs modify streaming behavior in pkg/kit/kit.go; this PR adds StreamEvent capture into TurnResult while that PR handles streaming configuration/viper isolation precedence.
mark3labs/kit#54: Both PRs extend extension/tool state wiring via turnAggregator/llmUsageMeta in pkg/kit/extensions_bridge.go and enrich TurnResult with usage/agent-end data.

Poem

🐇 Hoppity-hop through the stream of events,
Each delta and halt lands right where it's meant.
The config path locked, no more data race fright,
Raw tools and sentinels shine in the night.
FS skills embedded with go:embed flair—
The rabbit ships features beyond all compare! ✨

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 60.87% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'feat(sdk): harden pkg/kit embedder surface with scoped additions' accurately captures the PR's main objective of expanding and strengthening the pkg/kit SDK surface with backward-compatible additions to unblock embedder frameworks.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch feat/68-embedder-sdk-hardening

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 8

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

pkg/kit/kit.go (1)

2359-2361: ⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Emit StreamEventReasoningStart on reasoning-start callbacks.

OnReasoningStart currently emits only ReasoningStartEvent; TurnResult.Stream misses the start marker on providers using reasoning callbacks (non-<think> path).

Proposed fix

 		OnReasoningStart: func(id string) {
 			m.events.emit(ReasoningStartEvent{ID: id})
+			collector.add(StreamEvent{Kind: StreamEventReasoningStart})
 		},

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@pkg/kit/kit.go` around lines 2359 - 2361, The OnReasoningStart callback
function currently only emits ReasoningStartEvent but needs to also emit
StreamEventReasoningStart to ensure TurnResult.Stream has the proper start
marker for providers using reasoning callbacks. In addition to the existing
m.events.emit call with ReasoningStartEvent, add another m.events.emit call to
emit StreamEventReasoningStart with the same ID parameter to properly mark the
start of the stream event on the non-think path.

🧹 Nitpick comments (3)

pkg/kit/errors_test.go (1)

37-41: ⚡ Quick win

Use identity check for “unchanged error” assertion.

For the unclassified path, comparing Error() text can hide wrapping/regression. Checking pointer identity verifies the contract directly.

✅ Suggested test tweak

-			if tc.want == nil {
-				// Unclassified errors are returned unchanged.
-				if got.Error() != tc.in.Error() {
-					t.Fatalf("expected unchanged error, got %v", got)
-				}
-				return
-			}
+			if tc.want == nil {
+				// Unclassified errors are returned unchanged.
+				if got != tc.in {
+					t.Fatalf("expected unchanged error instance, got %v", got)
+				}
+				return
+			}

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@pkg/kit/errors_test.go` around lines 37 - 41, The unclassified error
assertion in the test is comparing error text representations using the Error()
method, which can mask wrapping or modifications to the error object. Replace
the comparison of Error() string values with a direct pointer identity check
comparing got and tc.in directly, so the test verifies that the exact same error
object is returned unchanged rather than just checking that the text
representation matches.

pkg/kit/rawtool_test.go (1)

58-72: ⚡ Quick win

Add a regression case for whitespace-null input.

Given NewRawTool accepts null as “no args”, add a case for " null " to lock consistent behavior and prevent nil-map regressions.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@pkg/kit/rawtool_test.go` around lines 58 - 72, Add a regression test case
within TestNewRawToolInvalidArgs or create a new test function to verify that
NewRawTool handles whitespace-padded null input (such as " null " with
surrounding spaces) consistently with plain null input. The test should call
tool.Run with this whitespace-null input and verify that the handler is not
invoked and the response indicates proper error handling, ensuring nil-map
parsing behavior remains consistent across different null input variations.

pkg/kit/kit.go (1)

2801-2802: ⚡ Quick win

Wrap returned errors with operation context.

These returns drop call-site context. Wrap with fmt.Errorf("...: %w", err) for debuggability and consistency.

Proposed fix

-			return nil, err
+			return nil, fmt.Errorf("set model with prompt overrides: %w", err)
@@
-		return nil, err
+		return nil, fmt.Errorf("apply prompt options: %w", err)

As per coding guidelines, Always check errors and wrap them with fmt.Errorf("context: %w", err).

Also applies to: 2843-2845

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@pkg/kit/kit.go` around lines 2801 - 2802, Error returns at lines 2801-2802
and 2843-2845 lack operation context for debugging. Wrap each bare error return
statement (where `return nil, err` or similar appears) with `fmt.Errorf` to add
descriptive context about what operation failed, using the pattern
`fmt.Errorf("operation description: %w", err)`. This applies to all error return
statements in the affected code sections to maintain consistency with project
coding guidelines.

Source: Coding guidelines

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@internal/skills/skills.go`:
- Around line 184-185: The errs variable declared at line 184 is a slice of
strings ([]string), and at lines 205, 210, and 220, errors are being converted
to strings before appending, which destroys the error chain and breaks
errors.Is/As functionality downstream. Change the errs variable from a slice of
strings to a slice of error types ([]error), preserve the actual error objects
when appending to this slice instead of converting them to strings, ensure
individual errors are wrapped with appropriate context using fmt.Errorf with %w
wrapping, and ensure the final aggregated error properly maintains the error
chain so downstream consumers can use errors.Is and errors.As correctly.

In `@pkg/kit/compaction.go`:
- Around line 119-124: The error handling block in the compactInternal function
does not wrap the error returned from m.compactImpl with context information.
Wrap the error using fmt.Errorf with an appropriate contextual message before
using it in both the CompactionEvent emission and the return statement. Use the
%w verb to preserve the original error while adding call-site context that will
aid debugging for subscribers and callers.

In `@pkg/kit/errors.go`:
- Around line 33-40: The documentation for the ClassifyProviderError function
claims "The original error remains reachable via errors.Unwrap", but this is
incorrect because the standard errors.Unwrap function only works with Unwrap()
error (returning a single error), not Unwrap() []error (returning multiple
errors). Verify the actual implementation of the error type returned by
ClassifyProviderError and ensure it properly implements the Unwrap method
according to the standard library contract. If the function returns an error
that implements Unwrap() []error, either change it to implement Unwrap() error
returning a single unwrapped error, or update the documentation to accurately
reflect what can be retrieved via errors.Unwrap.

In `@pkg/kit/kit.go`:
- Around line 2803-2808: The restore closure in the defer block calls
m.SetModel(ctx, prevModel) using the caller's context, which may be canceled or
expired by the time the defer executes, causing the error to be silently ignored
with an underscore. Replace the caller's context with a background context or a
new context that won't be canceled when calling m.SetModel in the restore
function, ensuring the rollback can complete successfully and prevent per-call
overrides from leaking into subsequent calls.
- Around line 2796-2800: Replace the direct m.v.Set() calls for the three
configuration keys (thinking-level, provider-url, and provider-api-key) with the
restoreViperString helper function to properly respect the prev*Set flags that
were saved at the start of the block. Instead of calling
m.v.Set("thinking-level", prevThinking), m.v.Set("provider-url", prevURL), and
m.v.Set("provider-api-key", prevKey) directly, use restoreViperString for each
key with its corresponding previous value and flag, matching the pattern already
used at lines 2803-2806 for the restore() operation.

In `@pkg/kit/sessions.go`:
- Around line 220-227: The CollapseBranch method in the Kit type accepts a toID
parameter that is not used in the implementation, since AppendBranchSummary
always collapses to the current leaf. Add godoc documentation to the
CollapseBranch function that clarifies the toID parameter is unused and
explicitly documents that the method always collapses branches to the current
leaf regardless of what toID value is passed, to prevent callers from being
confused about the actual behavior.

In `@pkg/kit/tools.go`:
- Around line 5-8: The null check comparing `string(input) != "null"` is brittle
because it doesn't handle whitespace variants like " null ", which can cause
json.Unmarshal to unexpectedly set args to nil. Update the null check logic by
using strings.TrimSpace to remove leading and trailing whitespace from the input
before comparing it to "null". Apply this fix to all occurrences of the brittle
null comparison, including the locations around the import section and the
also-applies-to range mentioned in the comment.

In `@www/pages/sdk/sessions.md`:
- Line 102: The text in the SessionManager documentation section contains an
incorrect reference labeled "SDK skill reference" that links to a repository and
appears outdated for this context. Replace the misleading "SDK skill reference"
link text and URL with an appropriate reference that correctly documents the
SessionManager interface definition, ensuring it accurately reflects the actual
documentation or interface definition location being referenced.

---

Outside diff comments:
In `@pkg/kit/kit.go`:
- Around line 2359-2361: The OnReasoningStart callback function currently only
emits ReasoningStartEvent but needs to also emit StreamEventReasoningStart to
ensure TurnResult.Stream has the proper start marker for providers using
reasoning callbacks. In addition to the existing m.events.emit call with
ReasoningStartEvent, add another m.events.emit call to emit
StreamEventReasoningStart with the same ID parameter to properly mark the start
of the stream event on the non-think path.

---

Nitpick comments:
In `@pkg/kit/errors_test.go`:
- Around line 37-41: The unclassified error assertion in the test is comparing
error text representations using the Error() method, which can mask wrapping or
modifications to the error object. Replace the comparison of Error() string
values with a direct pointer identity check comparing got and tc.in directly, so
the test verifies that the exact same error object is returned unchanged rather
than just checking that the text representation matches.

In `@pkg/kit/kit.go`:
- Around line 2801-2802: Error returns at lines 2801-2802 and 2843-2845 lack
operation context for debugging. Wrap each bare error return statement (where
`return nil, err` or similar appears) with `fmt.Errorf` to add descriptive
context about what operation failed, using the pattern `fmt.Errorf("operation
description: %w", err)`. This applies to all error return statements in the
affected code sections to maintain consistency with project coding guidelines.

In `@pkg/kit/rawtool_test.go`:
- Around line 58-72: Add a regression test case within TestNewRawToolInvalidArgs
or create a new test function to verify that NewRawTool handles
whitespace-padded null input (such as " null " with surrounding spaces)
consistently with plain null input. The test should call tool.Run with this
whitespace-null input and verify that the handler is not invoked and the
response indicates proper error handling, ensuring nil-map parsing behavior
remains consistent across different null input variations.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: cac61dfc-3076-466f-85e5-9a2f8d3370a6

📥 Commits

Reviewing files that changed from the base of the PR and between bd56f4a and dd44fad.

📒 Files selected for processing (24)

internal/agent/agent.go
internal/config/config.go
internal/config/configpath_test.go
internal/skills/skills.go
internal/skills/skills_fs_test.go
pkg/kit/README.md
pkg/kit/adapter.go
pkg/kit/compaction.go
pkg/kit/errors.go
pkg/kit/errors_test.go
pkg/kit/events.go
pkg/kit/kit.go
pkg/kit/rawtool_test.go
pkg/kit/session.go
pkg/kit/sessions.go
pkg/kit/skills.go
pkg/kit/testing.go
pkg/kit/tools.go
pkg/kit/turn_capture.go
pkg/kit/turn_capture_test.go
www/pages/advanced/json-output.md
www/pages/sdk/callbacks.md
www/pages/sdk/overview.md
www/pages/sdk/sessions.md

- skills: aggregate load errors as []error with errors.Join + %w so the chain stays inspectable via errors.Is/As (both LoadSkillsFromDir and LoadSkillsFromFS) - errors: correct ClassifyProviderError godoc — sentinelError implements Unwrap() []error, which errors.Unwrap does not traverse; reference errors.Is instead (also fixed in the docs site) - kit: use restoreViperString on the applyPromptOptions error path so previously-unset config keys are cleared, not forced to "" - kit: roll back the per-call model override with context.Background() so a canceled caller ctx can't leak the override into later calls - sessions: document that CollapseBranch's toID is unused and the branch always collapses to the current leaf - docs: retarget the SessionManager link to its pkg.go.dev definition Refs #68

Normalise the raw input with bytes.TrimSpace before the null/empty check so " null ", "\tnull\n", etc. follow the same skip-unmarshal path as the bare "null" and the handler always receives a non-nil empty map. Removes an implicit dependency on fantasy trimming the value during its RawMessage decode. Adds a contract test for the null-variant inputs. Refs #68

coderabbitai Bot reviewed Jun 18, 2026

View reviewed changes

ezynda3 added 2 commits June 18, 2026 17:22

ezynda3 merged commit dd7ae41 into master Jun 18, 2026
3 checks passed

This was referenced Jun 18, 2026

feat: embedder SDK hardening — deferred structural items (provider injection, LLMUsage.Extra, Kit.Fork) #70

Open

feat: embedder SDK hardening — 14 scoped additions for higher-level framework authors #68

Closed

coderabbitai Bot mentioned this pull request Jun 18, 2026

feat(skills): full agentskills.io spec compliance #71

Merged

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(sdk): harden pkg/kit embedder surface with scoped additions#69

feat(sdk): harden pkg/kit embedder surface with scoped additions#69
ezynda3 merged 3 commits into
masterfrom
feat/68-embedder-sdk-hardening

ezynda3 commented Jun 18, 2026 •

edited by mark-iii-labs-huly Bot

Loading

Uh oh!

mark-iii-labs-huly Bot commented Jun 18, 2026

Uh oh!

coderabbitai Bot commented Jun 18, 2026 •

edited

Loading

Review limit reached

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related issues

Possibly related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ezynda3 commented Jun 18, 2026 • edited by mark-iii-labs-huly Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of Change

What's included

Checklist

Additional Information

Summary by CodeRabbit

Release Notes

Uh oh!

mark-iii-labs-huly Bot commented Jun 18, 2026

Uh oh!

coderabbitai Bot commented Jun 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review limit reached

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related issues

Possibly related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ezynda3 commented Jun 18, 2026 •

edited by mark-iii-labs-huly Bot

Loading

coderabbitai Bot commented Jun 18, 2026 •

edited

Loading